Systems, processes, and computer program products for detecting and verifying invisible information in documents

ABSTRACT

Embodiments identify documents with invisible or hidden information such as tracking codes and extract the tracking codes, which may be leveraged for different applications. Aspects may protect an individual&#39;s right to privacy by extracting “invisible” codes from a document. Other aspects preserve the authenticity of documents by converting the information of an extracted tracking code into other metadata which may accompany the document and copies of the document. In an embodiment, a blockchain ledger may be used to verify the authenticity of a document using information from the original tracking code.

CROSS-REFERENCE TO RELATED APPLICATION

None.

FIELD

The subject disclosure relates to networks and more particularly tosystems, processes, and computer program products for detecting andverifying invisible information in documents and for authentication ofdocuments.

BACKGROUND

Different brands of color laser printers add invisible or hiddeninformation to documents. In one form, the information is printed asvisually imperceptible yellow dots. The barely perceptible yellow dotscan be used to trace a printed document to its source: the date and timeit was printed, and the serial number of the exact printer on which itwas printed. The feature was originally intended as a deterrent tocounterfeiting currency with laser printers.

The millimeter-sized dots appear about every inch on a page, nestledwithin the printed words and margins. The dots' minuscule size, coveringless than one-thousandth of the page, along with their color combinationof yellow on white, makes them invisible to the naked eye.

SUMMARY

In one aspect of the disclosure, a method of protecting the authenticityof documents is disclosed. The method comprises: receiving a document,including invisible printed information, by an optical scanning device;scanning the document; capturing an image of content in the scanneddocument; identifying the invisible printed information in the capturedimage of the scanned document; generating metadata representing theidentified invisible printed information; extracting the invisibleprinted information from the captured image of the scanned document;converting the captured image of the scanned document into an electroniccopy of the received document, without the invisible printed informationbeing present in the electronic copy; and attaching the generatedmetadata to the converted electronic copy of the received document.

In another aspect of the disclosure, a computer program product toprotect the authenticity of documents is disclosed. The computer programproduct comprises a non-transitory computer readable storage mediumhaving computer readable program code embodied therewith. The computerreadable program code is configured, when executed by a processor, to:receive a document, including invisible printed information, by anoptical scanning device; scan the document; capture an image of contentin the scanned document; identify the invisible printed information inthe captured image of the scanned document; generate metadatarepresenting the identified invisible printed information; extract theinvisible printed information from the captured image of the scanneddocument; convert the captured image of the scanned document into anelectronic copy of the received document, without the invisible printedinformation being present in the electronic copy; and attach thegenerated metadata to the converted electronic copy of the receiveddocument.

In yet another aspect, a computing apparatus is disclosed. The computingapparatus, comprises an optical scanner module; a memory storage unit;and a processing unit coupled to the optical scanner module and to thememory storage unit, wherein computer readable program code in thememory storage unit is configured to protect the authenticity ofdocuments, by controlling the processing unit to: receive a document,including invisible printed information, by the optical scanner module;scan the document; capture an image of content in the scanned document;identify the invisible printed information in the captured image of thescanned document; generate metadata representing the identifiedinvisible printed information; extract the invisible printed informationfrom the captured image of the scanned document; convert the capturedimage of the scanned document into an electronic copy of the receiveddocument, without the invisible printed information being present in theelectronic copy; and attach the generated metadata to the convertedelectronic copy of the received document.

It is understood that other configurations of the subject technologywill become readily apparent to those skilled in the art from thefollowing detailed description, wherein various configurations of thesubject technology are shown and described by way of illustration. Aswill be realized, the subject technology is capable of other anddifferent configurations and its several details are capable ofmodification in various other respects, all without departing from thescope of the subject technology. Accordingly, the drawings and detaileddescription are to be regarded as illustrative in nature and not asrestrictive.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a system flow diagram of a method for determining theauthenticity of documents through extracted information in accordancewith an embodiment of the subject technology.

FIG. 2 is a flowchart for a method of capturing tracking codes in aprinted document in accordance with embodiments of the subjecttechnology.

FIG. 3 is a block diagram of a multifunction printer system inaccordance with embodiments of the subject technology in accordance withembodiments of the subject technology.

FIG. 4 is a block diagram of a scanner system in accordance with anembodiment of the subject technology.

FIG. 5 is a block diagram of a distributed blockchain ledger inaccordance with an embodiment of the subject technology.

FIG. 6 is a flowchart of a method of authenticating access to a documentin accordance with an embodiment of the subject technology.

FIG. 7 is a block diagram of a computing device in accordance with anaspect of the subject technology.

DETAILED DESCRIPTION

The detailed description set forth below is intended as a description ofvarious configurations of the subject technology and is not intended torepresent the only configurations in which the subject technology may bepracticed. The appended drawings are incorporated herein and constitutea part of the detailed description. The detailed description includesspecific details for the purpose of providing a thorough understandingof the subject technology. However, it will be apparent to those skilledin the art that the subject technology may be practiced without thesespecific details. Like or similar components are labeled with identicalelement numbers for ease of understanding.

As will be appreciated, embodiments disclosed below may be practiced inand have industrial applicability to the general fields of documentscanning and document authentication.

In general, as used below, a “document” may refer to a sheet of paper ormultiple sheets that will be assembled into an overall publication. A“document” may also refer to an electronic copy of a paper where anelectronic copy is not specifically referenced.

Generally, embodiments of the subject technology provide identificationof documents with information embedded in the document that may beinvisible to the naked eye. Invisible information may include forexample, microdots, barcodes, text/graphics printed in grayscale orcolor imperceptible to the human eye, or text/graphics in fontimperceptible to the human eye.

The embodiments disclosed identify the presence of invisible or hiddeninformation printed onto the document and extract the invisibleinformation from an electronic copy of the document. Information fromextracted tracking codes, for example, may be leveraged for differentapplications.

As will be appreciated, aspects of the disclosed embodiments provideseveral benefits. Aspects may protect an individual's rights byextracting these “invisible” codes from a document. For example, whilemicrodots printed onto a document were useful to identify a suspect wholeaked a top-secret classified document, the tracking informationassociated with the dots may violate an individual's right to privacy.

From a business perspective, numerous previously printed documents needto be scanned/captured and integrated into DMS/ECM systems. Preservingthe fidelity of the original document is a challenge due to limited scanquality. In this case, those microdots may not be captured properly, andthe document may lose its tracking identity. Aspects may preserve thecontent of the hidden information by converting the document into a modefor better accuracy of optical recognition. In addition, someembodiments convert the hidden information into another form ofinformation that can be more easily read.

Other aspects preserve the authenticity of documents by converting theinformation of an extracted tracking code into other metadata which mayaccompany the document and copies of the document. In an embodiment, ablockchain ledger may be used to verify the authenticity of a documentusing information from the original tracking code. Embedded software maygenerate metadata with tracking information and may integrate metadatainto the scanned document. Later the metadata can be used by a DMS/ECMsystem for its classification, workflow, archiving process. The goal isto retain document history and original identity. This may be veryuseful for the administration of or access to copies of legal/sensitivedocuments including for example, social security numbers, birthcertificates, contracts, legal agreements, etc.

Some embodiments may capture documents, preserve the information as partof a document's history and authenticate/validate the user informationvia blockchain for security and privacy. As may be appreciated, someaspects enhance ECM with blockchain concepts that can serve both userand governance needs. The blockchain-enhanced enterprise contentplatform may provide a secure and compliant system that protects data,respects individual privacy and empowers companies by creatinganonymized, verified and self-managed ‘profiles’ for employees,customers and suppliers.

In addition, some aspects are useful for identifying forgeries. Thetracking code can be integrated into a blockchain (distributed ledger)as unique identity, especially for confidential documents (legalagreements that signed by different parties). Forgeries may beidentified when the stored metadata of a tracking code does not matchinformation in a presented document.

Definitions

MFP—Multi Function Printer

A multifunction printer (MFP) is a device that consolidates thefunctionality of a printer, copier, scanner and/or fax into one machine.

Embedded software—A software running in a device (Printer/MFP)

Embedded software is computer software, written to control machines ordevices that are not typically thought of as computers, commonly knownas embedded systems. It is typically specialized for the particularhardware that it runs on and has time and memory constraints.

Capture—the process of scanning paper documents

Document capture is any one of several processes used to convert aphysical document to another format, typically a digital representation.At its simplest, document capture involves scanning a physical documentand saving it as a digital image. However, in the context of enterpriseinformation management, creating a digital image file is often notadequate for business purposes. For text documents, capture usuallyincludes processes like optical character recognition (OCR), so that theinformation contained in the document can be accessed and integratedwith an organization's information systems.

OCR—Optical Character Recognition

OCR is the recognition of printed or written text characters by asoftware. This involves photoscanning of the textcharacter-by-character, analysis of the scanned-in image, and thentranslation of the character image into character codes, such as ASCII,commonly used in data processing. In OCR processing, the scanned-inimage or bitmap is analyzed for light and dark areas in order toidentify each alphabetic letter or numeric digit. When a character isrecognized, it is converted into an ASCII code. Special circuit boardsand computer chips designed expressly for OCR are used to speed up therecognition process.

Metadata

Metadata is structured information that describes, explains, locates, orotherwise makes it easier to retrieve, use, or manage an informationresource. Metadata is often called data about data or information aboutinformation.

There are three main types of metadata:

Descriptive metadata describes a resource for purposes such as discoveryand identification. It can include elements such as title, abstract,author, and keywords.

Structural metadata indicates how compound objects are put together, forexample, how pages are ordered to form chapters.

Administrative metadata provides information to help manage a resource,such as when and how it was created, file type and other technicalinformation, and who can access it. There are several subsets ofadministrative data:

Rights management metadata, which deals with intellectual propertyrights

Preservation metadata, which contains information needed to archive andpreserve a resource.

In this disclosure, administrative metadata can be used to track oridentify forgery.

DMS—Document Management System

A document management system is a system used to track, manage and storedocuments and reduce paper. Most are capable of keeping a record of thevarious versions created and modified by different users. Documentmanagement systems commonly provide storage, versioning, metadata,security, as well as indexing and retrieval capabilities. The DMS mayalso extract metadata from the document automatically or prompt the userto add metadata in some of the steps described below.

ECM—Enterprise Content Management

Enterprise content management extends the concept of content managementby adding a timeline for each content item and possibly enforcingprocesses for the creation, approval and distribution of them.

Enterprise content management, as a form of content management, combinesthe capture, search and networking of documents with digital archiving,document management and workflow. It specifically includes the specialchallenges involved in using and preserving a company's internal, oftenunstructured information, in all of its forms.

Distributed Ledger

A distributed ledger may be a database that is consensually shared andsynchronized across network spread across multiple sites, institutionsor geographies. The distributed ledger database may be spread acrossseveral nodes (devices) on a peer-to-peer network, where each replicatesand saves an identical copy of the ledger and updates itselfindependently. In some embodiments, there is an advantage to a lack ofcentral authority. When a ledger update happens, each node may constructthe new transaction, and then the nodes vote by consensus algorithm onwhich copy is correct. Once a consensus has been determined, all theother nodes update themselves with the new, correct copy of the ledger.Security may be accomplished through cryptographic keys and signatures.

Blockchain

A blockchain may be a ledger that records a history of transactionsbetween all the users since its creation. It permits the storage andtransmission of information. The database is distributed, which meansthe transactions are validated by a consensus process and shared to allthe participants (Peer-to-peer). It allows counterparties to makesecured direct transactions without a central intermediary. Trust isensured through a mutual consensus verification protocol. The set ofrules and procedures for authenticating and validating the informationare added to the distributed ledger by the members of the network (allthe members or a majority) and not by a central authority.

Embodiments of the Subject Technology

For the following flowcharts, steps of the processes may be referred toas blocks. As will be described further below, steps may be performed bya machine, for example, a computing device, an image forming apparatus,and/or a processing unit. In general, the steps may be coordinatedthrough a server(s) connected to the blockchain. While the steps may bedescribed in one order, it will be understood that variations to theorder may be practiced or the order in which claims below are writtenmay vary from the following without departing from the scope of theinvention.

Referring now to FIG. 1, a method 100 of protecting the authenticity ofdocuments is shown according to an exemplary embodiment. FIG. 1 shows asequence diagram of steps that may occur between elements in a systemembodiment. The system elements may include a multi-function printer(MFP) 110 with a printer panel, an embedded optical characterrecognition (OCR) module 120, an embedded software application 130, adocument management system (DMS) or enterprise content management system(ECM) 140, and an enterprise resource planning (ERP) orcustomer-relationship management (CRM) system 150.

In block 160, a document may be received by the MFP 110. The MFP 110 mayinclude a scanner unit as will be described in further detail below inFIG. 3 and FIG. 4. The scanner unit may start a scan process of thedocument for text and graphics forming one or more images. The document(or its image) may be captured and transferred 162 to the OCR module120. The OCR module 120 may create 164 a searchable version of thescanned document (for example, as a PDF document of the transferredimage(s)). The scanned document may be forwarded 166 to the softwareapplication 130. The software application 130 may identify 168 anyinvisible or hidden information (for example, a tracking code) printedinto the originally received document from the image of the scanneddocument. In an exemplary embodiment, identified invisible printedinformation may be extracted from the scanned document. In someembodiments, confirmation of the information being extracted may be sent170 to the MFP 110. Some embodiments may include a query sent 172 to theuser asking if the invisible information may be extracted from thescanned document. The query may be displayed on the printer panel andinput from the user may be received through the printer panel. Aconfirmation message from the user may be sent 174 back to the softwareapplication. The invisible information may be removed (extracted) 176from the scanned document. In an exemplary embodiment, metadatarepresenting the content of invisible information that was removed maybe generated and saved as a separate file. The scanned document with thecaptured image may be converted into an electronic copy of the receiveddocument, without the invisible printed information being present in theelectronic copy. The MFP 110 may complete 178 the scan operation.

In some embodiments, the method 100 may further include anauthentication process which may authenticate documents or access todocuments. The electronic copy and the generated metadata of theinvisible information may be sent 180 to the DMS/ECM system 140. Thegenerated metadata representing the invisible information may beembedded 182 by the DMS/ECM system 140 into the converted electroniccopy to retain history of the document and the original identity of thedocument. For example, the information may indicate the originalcreation of the document, its source, and a hash of its original statebefore any modifications. A copy of the electronic copy with themetadata may be sent 184 for archival to a database. The database may befor example, a distributed ledger. In some embodiments, the ERP/CRMsystem 150 may comprise a distributed ledger for administeringtransactions involving authentication of documents or access todocuments. The copy of the electronic copy with the metadata may bestored 186 in the ERP/CRM system 150 for authentication processing andarchiving.

For example, in the context of checking 188 the authenticity of adocument (which may include checking the document for modificationssince its creation), the process may include extracting invisibleinformation and generating metadata from the extracted information of adocument being authenticated per the steps described above. The metadataof the document being authenticated may be compared 190 to the storedmetadata of the original document. The comparison may be performed forexample, by the DMS/ECM system 140. Confirmation that the metadata ofthe two documents match may be sent 192 to the ERP/CRM system 150. Inthe alternative, if the metadata being compared do not match, forgeriesof the document may be identified, and an alert may be sent 194 to anadministrator indicating a document being used in a transaction is notauthentic.

In another embodiment, access to an authentic document may use theextracted information to authenticate authorization to the document. Forexample, the metadata may also be used for authenticating an access tosensitive documents. In an exemplary application, school administrationwould receive an authentication code to access an authenticated copy ofa student's birth certificate stored in the DMS/ECM system 140 insteadof requiring the student to provide an original copy. The authenticationcode (password) may be generated via SHA256 hash of metadata (trackingcode information—time&serial #) which was attached to the authenticatedcopy of the certificate. The school administration may receive throughan encrypted transaction a copy of the access code provided by theDMS/ECM system 140. Upon input of the access code, the DMS/ECM system140 may compare the metadata from the input code to a copy of themetadata stored in the ERP/CRM system 150 to authenticate access.

Referring now to FIG. 2, a method 200 of capturing tracking codes in aprinted document is shown according to an exemplary embodiment. Themethod 200 may expand on the details of blocks 160-166 performed at thescanner module level. In block 210, the received document may bescanned. A command to scan may be performed in response to inputreceived at the Printer/MFP panel. A high resolution (for example, 1200dpi) JPEG image may be generated 220. A determination 230 of whetherinvisible information (for example, tracking codes) was present in theJPEG image and captured may be performed. If invisible information wasnot captured, the system may convert 240 the scanner module to black andwhite mode. As will be appreciated, some invisible information such astracking codes is in yellow and may be difficult to capture using aregular scanner due to light color texture. The document may be scannedin black-n-white (monochrome) mode, which will enhance thescanning/capturing quality of content present in the image that may notbe capturable in high resolution color mode. That way the tracking codecan be detected easily as light gray color instead of yellow, which alsoimproves the OCR accuracy. However, if invisible information wasdetected present in the high resolution image, then the generated imagemay be sent 250 to the OCR module for processing the content of theimage including the invisible information.

Referring now to FIG. 3, a multifunction printer system 300 is shownaccording to an exemplary embodiment. The multifunction printer (MFP)system 300 is a device that consolidates the functionality of a printer,copier, scanner and/or fax into one machine. In general, the MFP system300 includes a scanner block 310, a main processing block 320, aprinting block 330, and a user interface (UI) block 340. The scannerblock 310 may perform the steps related to FIG. 2 above. In FIG. 1, thescanner block 310 may be responsible for blocks 160, 162, 172, 174, and178 in FIG. 1. The main processing block 320 may perform many of thedecision and software based operations described above. A processingunit (labeled as “CPU”) may perform many of the action based stepsdisclosed throughout the specification as executable program steps insoftware or firmware resident in the system. Details of the processingunit may seen further below with respect to a general computing device.The printing block 330 may generate hard copies of electronic files intopaper documents. For example, an original document with invisibleinformation, a scanned document with the invisible information extractedfrom the page, or a scanned document with metadata inserted onto thepage, representing extracted information, may be printed out onto asheet of paper. The UI block 340 may include a user interface panel fromwhich a user may press hardware or digitally displayed buttons whichtrigger one or more commands. An exemplary command may includeconfirming the extraction of invisible information from a document (seeblock 172) which triggers the removal of the information beforeconverting the document into a PDF document without the invisibleinformation present.

Referring now to FIG. 4, a scanner system 400 is shown according to anexemplary embodiment. The scanner system 400 may be a high resolutiontype scanner apparatus. Generally, the scanner system 400 may beintegrated into the MFP system 300 or may be a standalone unit connectedto a network. In some embodiments, the methods described in FIG. 1 andFIG. 2 may be performed by the scanner system 400 which may process apaper document for invisible information using embedded software that isadministered by the image processing unit.

FIG. 5 shows a distributed ledger 500 that may be used in conjunctionwith the method 100 disclosed above in FIG. 1. The distributed ledger500 may generally comprise a plurality of computing devices 700, witheach device including a database 730 storing data including copies ofauthenticated documents and metadata which may be referenced duringauthentication processes described with respect to blocks 182-194. Thedistributed ledger 500 may be part of the DMS or CRM system 150. Thedatabase may be consensually shared and synchronized across a networkspread across multiple sites, institutions or geographies. Thedistributed ledger database (which includes the multiple databaserepositories 730 of each computing device 700) is spread across severalnodes (computing devices 700) on a peer-to-peer network, where eachreplicates and saves an identical copy of the ledger and updates itselfindependently. The primary advantage is the lack of central authority.When a ledger update happens, each node constructs the new transaction,and then the nodes vote by consensus algorithm on which copy is correct.Once a consensus has been determined, all the other nodes updatethemselves with the new, correct copy of the ledger. Security isaccomplished through cryptographic keys and signatures.

The distributed ledger 500 may be a blockchain network in someembodiments, which is a ledger that records a historic of transactionsbetween all the users since its creation. It permits the storage andtransmission of information. The database is distributed, which meansthe transactions are validated by a consensus process and shared to allthe participants (Peer-to-peer). It allows counterparties to makesecured direct transactions without a central intermediary. Trust isensured through a mutual consensus verification protocol. The set ofrules and procedures for authenticating and validating the informationare added to the distributed ledger by the members of the network (allthe members or a majority) and not by a central authority.

Referring now to FIG. 6, a method 600 of authenticating access to adocument is shown according to an exemplary embodiment. As will beunderstood, the method 600 may operate in conjunction with (or withaspects of) the embodiments disclosed above. In general, the method 600may be considered a two-part process. The first part is for generating asmart contract with a hash value based on hidden information (forexample, tracking codes) extracted from a document. The second partprovides verification that a party requesting electronic access to adocument has permission to access the document. Proof of access is basedon the requesting party providing a system authorized key that isauthenticated against information which may be stored in a blockchainnetwork.

For example, in block 605, a document may be read for its hiddeninformation. In some embodiments, the hidden information may be one ofthe extracted tracking codes that were embedded in an original documentdescribed above. The hidden information may be metadata. Metadata mayinclude for example, a date and time of the document's creation (or someother timestamp associated with the document). Metadata may also includea serial number assigned to the document. In block 610, encryption logicmay be applied to the information read from block 605 to generate apublic/private key pair associated access to the document. In someembodiments, hash logic may concurrently generate a hash value for thepublic/private key pair in block 615. In block 620, the hash value and apublic key may be used to generate a smart contract for access to thedocument. In some embodiments, the smart contract generation process maybe operated within a blockchain application of a blockchain thatprovides authentication for access to documents. In block 625, the smartcontract may be entered into a blockchain network.

In block 630, a party may request access to a document stored in adatabase. To access the document, a verification process based on theinformation from the tracking code of block 605 may be implemented. Inblock 635, the system may receive a public key generated message fromthe requesting party. In block 640, the public key/private key processmay be retrieved from the blockchain and used to generate a hash valueto authenticate the user's access to the document. In block 645, thesystem may determine whether the hash value generated by the requestingparty matches the hash value to execute the smart contract generated forthe document. If the hash value comparison is a match, then the originaldocument may be accessed by the requesting party in block 655.Otherwise, in block 650, access may be denied if the hash values do notmatch.

Referring now to FIG. 7, a computing device 700 is shown according to anexemplary embodiment. The computing device 700 may be for example, acomputer system or a computer server. In some embodiments, the MFP 110,the MFP 300, and the scanner system 400 may be other examples of acomputing device 700. As will be appreciated, some aspects of theembodiments disclosed above may turn the computing device 700 into aspecial purpose computer system for identifying invisibleinformation/tracking codes and extracting the codes from documents. Forexample, in the role of a host server, the computing device 700 mayimplement for example the functions related to posting blockchainrequests for authentication of documents or for authenticatingauthorized access to a document as described above. In the role of auser device, the computing device 700 is generally not a server but mayinstead be desktop computers, tablet or laptop computers, all-in-onecomputer stations, a mobile computing device (for example, a smartphone, smart wearable devices (glasses, jewelry, watches, ear wear,etc.), or programmable electronics (generally connected to a printer orscanner) to perform the extraction of hidden information/tracking codes.

The components of the computing device 700, may include, but are notlimited to, one or more processors or processing units 710, a systemmemory 720, data storage 730, a computer program product 740 having aset of program modules 745 including files and executable instructions,and a bus system that couples various system components including thesystem memory 720 to the processor(s) 710. The memory storage 720 maystore for example, electronic files of the smart contract criteria andprint advertisement content. In some embodiments, the processors 710 mayinclude a processing unit dedicated to graphics and image processing(for example a graphics processing unit (GPU) or visual processing unit(VPU). As a GPU or VPU, the processor 710 may implement executableinstructions focused on image processing applications either alone or incoordination with other general processor 710 types such a CPUs,microchips, and the like.

The computing device 700 may be described in the general context ofcomputer system executable instructions, such as the program modules 745which represent a software embodiment of the system and processesdescribed generally above with respect to FIGS. 1 and 2. The programmodules 745 generally carry out the functions and/or methodologies ofembodiments as described above. The computing device 700 may typicallyinclude a variety of computer system readable media. Such media could bechosen from any available media that is accessible by the computingdevice 700, including non-transitory, volatile and non-volatile media,removable and non-removable media for use by or in connection with aninstruction execution system, apparatus, or device. A computer readablestorage medium may be, for example, but not limited to, an electronic,magnetic, optical, electromagnetic, infrared, or semiconductor system,apparatus, or device, or any suitable combination of the foregoing. Thesystem memory 720 could include one or more computer system readablemedia in the form of volatile memory, such as a random-access memory(RAM) and/or a cache memory. By way of example only, the data storagesystem 730 may read from and write to a non-removable, non-volatilemagnetic media device. An example of a data storage system is shown as a“hard drive” in the main processing block 320 of FIG. 3. The datastorage system may also sometimes be referred to as a “database” 730 andmay represent an entirely hardware-based storage, an entirelysoftware-based storage, or both. The system memory 720 and/or datastorage system 730 may include at least one program product 740 having aset (e.g., at least one) of program modules 745 that are configured tocarry out the functions of embodiments of the invention in the form ofcomputer executable instructions. The program product/utility 740,having a set (at least one) of program modules 745, may be stored as anoperating system, one or more application programs, other programmodules, and program data. Some embodiments may generate an electronicuser interface (viewable and controllable from the display unit shown inthe UI block 340 of FIG. 3) that may allow the user to enter smartcontract information and post blockchain requests.

The computing device 700 may communicate with one or more externaldevices including for example, an electronic display 750 which may insome embodiments be configured for tactile response as in a touch screendisplay. User input into the display 750 may be registered at theprocessor 710 and processed accordingly. Other devices may enable thecomputing device 700 to communicate with one or more other computingdevices, either by hardwire or wirelessly. Such communication can occurvia Input/Output (I/O) interfaces/ports 760. In some embodiments, theI/O interfaces/ports 760 may be specially configured to handle aspectsof the embodiments described herein converting the computing device 700into a special purpose machine. For example, as a printer, the I/Ointerfaces/ports 760 may be configured to transmit messages related toextraction of information and conversion of documents into documentswithout embedded hidden information. The I/O interfaces/ports 760 mayalso include printing modules (for example, ink jet print heads, laserprint systems, etc.) for rendering print jobs.

The computing device 700, through the I/O interface/ports 760, maycommunicate with one or more networks such as a local area network(LAN), a general wide area network (WAN), and/or a public network (e.g.,the Internet) via a network adapter as is commonly known in the art.Each of the operating system, one or more application programs, otherprogram modules, and program data or some combination thereof, mayinclude an implementation of a networking environment. In someembodiments, the computing device 700 may be a cloud computing nodeconnected to a cloud computing network which may be for examplerepresented by the network shown in FIG. 5. The computing device 700 maybe practiced in distributed cloud computing environments where tasks areperformed by remote processing devices that are linked through acommunications network. In a distributed cloud computing environment,program modules may be located in both local and remote computer systemstorage media including memory storage devices. In an exemplaryembodiment, the blockchain 500 described above may be one form of adistributed cloud computing environment.

As will be appreciated by one skilled in the art, aspects of thedisclosed invention may be embodied as a system, method or process, orcomputer program product. Accordingly, aspects of the disclosedinvention may take the form of an entirely hardware embodiment, anentirely software embodiment (including firmware, resident software,micro-code, etc.) or an embodiment combining software and hardwareaspects that may all generally be referred to herein as a “module”,“circuit”, or “system.” Furthermore, aspects of the disclosed inventionmay take the form of a computer program product embodied in one or morecomputer readable media having computer readable program code embodiedthereon. In some embodiments, the output of the computer program productprovides an electronic user interface on the display 750 which may becontrolled via direct contact with the display 750 or via the I/Ointerfaces 760 (which may be for example, interface devices such askeyboards, touchpads, a mouse, a stylus, or the like).

Aspects of the disclosed invention are described above with reference toblock diagrams of methods, apparatus (systems) and computer programproducts according to embodiments of the invention. It will beunderstood that each block of the block diagrams, and combinations ofblocks in the flowchart illustrations and/or block diagrams, can beimplemented by computer program instructions. These computer programinstructions may be provided to the processor 710 of a general-purposecomputer, special purpose computer, or other programmable dataprocessing apparatus to produce a machine, such that the instructions,which execute via the processor of the computer or other programmabledata processing apparatus, create means for implementing thefunctions/acts specified in the flowchart and/or block diagram block orblocks in the figures.

Those of skill in the art would appreciate that various components andblocks may be arranged differently (e.g., arranged in a different order,or partitioned in a different way) all without departing from the scopeof the subject technology. The previous description is provided toenable any person skilled in the art to practice the various aspectsdescribed herein. The previous description provides various examples ofthe subject technology, and the subject technology is not limited tothese examples. Various modifications to these aspects will be readilyapparent to those skilled in the art, and the generic principles definedherein may be applied to other aspects.

Thus, the claims are not intended to be limited to the aspects shownherein, but are to be accorded the full scope consistent with thelanguage of claims, wherein reference to an element in the singular isnot intended to mean “one and only one” unless specifically so stated,but rather “one or more.” Unless specifically stated otherwise, the term“some” refers to one or more. Pronouns in the masculine (e.g., his)include the feminine and neuter gender (e.g., her and its) and viceversa. Headings and subheadings, if any, are used for convenience onlyand do not limit the invention.

A phrase such as an “aspect” does not imply that such aspect isessential to the subject technology or that such aspect applies to allconfigurations of the subject technology. A disclosure relating to anaspect may apply to all configurations, or one or more configurations.An aspect may provide one or more examples. A phrase such as an aspectmay refer to one or more aspects and vice versa. A phrase such as an“embodiment” does not imply that such embodiment is essential to thesubject technology or that such embodiment applies to all configurationsof the subject technology. A disclosure relating to an embodiment mayapply to all embodiments, or one or more embodiments. An embodiment mayprovide one or more examples. A phrase such an embodiment may refer toone or more embodiments and vice versa. A phrase such as a“configuration” does not imply that such configuration is essential tothe subject technology or that such configuration applies to allconfigurations of the subject technology. A disclosure relating to aconfiguration may apply to all configurations, or one or moreconfigurations. A configuration may provide one or more examples. Aphrase such a configuration may refer to one or more configurations andvice versa.

The word “exemplary” is used herein to mean “serving as an example orillustration.” Any aspect or design described herein as “exemplary” isnot necessarily to be construed as preferred or advantageous over otheraspects or designs.

All structural and functional equivalents to the elements of the variousaspects described throughout this disclosure that are known or latercome to be known to those of ordinary skill in the art are expresslyincorporated herein by reference and are intended to be encompassed bythe claims. Moreover, nothing disclosed herein is intended to bededicated to the public regardless of whether such disclosure isexplicitly recited in the claims. No claim element is to be construedunder the provisions of 35 U.S.C. § 112, sixth paragraph, unless theelement is expressly recited using the phrase “means for” or, in thecase of a method claim, the element is recited using the phrase “stepfor.” Furthermore, to the extent that the term “include,” “have,” or thelike is used in the description or the claims, such term is intended tobe inclusive in a manner similar to the term “comprise” as “comprise” isinterpreted when employed as a transitional word in a claim.

What is claimed is:
 1. A method of protecting the authenticity ofdocuments, comprising: receiving a document, including invisible printedinformation, by an optical scanning device; scanning the document;capturing an image of content in the scanned document; determiningwhether the invisible printed information was present in the capturedimage of content in the scanned document; in response to a determinationthat the invisible printed information was not present, converting to ablack and white mode of scanning; identifying the invisible printedinformation in the captured image of the scanned document in the blackand white mode of scanning; extracting the invisible printed informationfrom the captured image of the scanned document; generating metadatarepresenting the identified invisible printed information; convertingthe captured image of the scanned document into an electronic copy ofthe received document, without the invisible printed information beingpresent in the electronic copy; and attaching the generated metadata tothe converted electronic copy of the received document.
 2. The method ofclaim 1, wherein the generated metadata is embedded into the convertedelectronic copy of the received document.
 3. The method of claim 1,wherein the generated metadata is text visible in the convertedelectronic copy of the received document.
 4. The method of claim 1,wherein the generated metadata is at least one of encrypted text andencrypted symbols.
 5. The method of claim 4, wherein the electronic copyof the received document includes a copy of the generated metadataincluding the at least one of encrypted text and encrypted symbols; andthe method further comprises providing the electronic copy of thereceived document including the at least one of encrypted text andencrypted symbols as an authenticated copy of the received document,provided through an encrypted transaction.
 6. The method of claim 1,wherein the invisible printed information is tracking code information.7. The method of claim 6, wherein the tracking code information includesa timestamp and serial code associated with the received document.
 8. Acomputer program product to protect the authenticity of documents, thecomputer program product comprising a non-transitory computer readablestorage medium having computer readable program code embodied therewith,the computer readable program code being configured, when executed by aprocessing unit, to: receive a document, including invisible printedinformation, by an optical scanning device; scan the document; capturean image of content in the scanned document; determine whether theinvisible printed information was present in the captured image ofcontent in the scanned document; in response to a determination that theinvisible printed information was not present, convert to a black andwhite mode of scanning; identify the invisible printed information inthe captured image of the scanned document in the black and white modeof scanning; generate metadata representing the identified invisibleprinted information; extract the invisible printed information from thecaptured image of the scanned document; convert the captured image ofthe scanned document into an electronic copy of the received document,without the invisible printed information being present in theelectronic copy; and attach the generated metadata to the convertedelectronic copy of the received document.
 9. The computer programproduct of claim 8, wherein the generated metadata is embedded into theconverted electronic copy of the received document.
 10. The computerprogram product of claim 8, wherein the generated metadata is textvisible in the converted electronic copy of the received document. 11.The computer program product of claim 8, wherein the generated metadatais at least one of encrypted text and encrypted symbols.
 12. Thecomputer program product of claim 11, wherein the electronic copy of thereceived document includes a copy of the generated metadata includingthe at least one of encrypted text and encrypted symbols; and thecomputer programmable code is further configured to, provide theelectronic copy of the received document including the at least one ofencrypted text and encrypted symbols as an authenticated copy of thereceived document, provided through an encrypted transaction.
 13. Thecomputer program product of claim 8, wherein the invisible printedinformation is tracking code information.
 14. The computer programproduct of claim 13, wherein the tracking code information includes atimestamp and serial code associated with the received document.
 15. Acomputing apparatus, comprising: an optical scanner module; a memorystorage unit; and a processing unit coupled to the optical scannermodule and to the memory storage unit, wherein computer readable programcode in the memory storage unit is configured to protect theauthenticity of documents, by controlling the processing unit to:receive a document, including invisible printed information, by theoptical scanner module; scan the document; capture an image of contentin the scanned document; determine whether the invisible printedinformation was present in the captured image of content in the scanneddocument; in response to a determination that the invisible printedinformation was not present, convert to a black and white mode ofscanning; identify the invisible printed information in the capturedimage of the scanned document in the black and white mode of scanning;generate metadata representing the identified invisible printedinformation; extract the invisible printed information from the capturedimage of the scanned document; convert the captured image of the scanneddocument into an electronic copy of the received document, without theinvisible printed information being present in the electronic copy; andattach the generated metadata to the converted electronic copy of thereceived document.
 16. The computing apparatus of claim 15, wherein thegenerated metadata is embedded into the converted electronic copy of thereceived document.
 17. The computing apparatus of claim 15, wherein thegenerated metadata is text visible in the converted electronic copy ofthe received document.
 18. The computing apparatus of claim 15, whereinthe generated metadata is at least one of encrypted text and encryptedsymbols.
 19. The computing apparatus of claim 18, wherein the electroniccopy of the received document includes a copy of the generated metadataincluding the at least one of encrypted text and encrypted symbols; andthe processing unit is further configured to, provide the electroniccopy of the received document including the at least one of encryptedtext and encrypted symbols as an authenticated copy of the receiveddocument, provided through an encrypted transaction.
 20. The computingapparatus of claim 15, wherein the invisible printed information istracking code information including a timestamp and serial codeassociated with the received document.